Optimal Sensor Scheduling via Non-standard Multi-armed Bandit Formulation
نویسندگان
چکیده
This paper addresses optimal sensor scheduling for maximizing network lifetime. We formulate this problem as a nonstandard multi-armed bandit process with non-discounted reward and finite stopping time. We find that the optimal strategy should choose the sensor that has the largest Gittins’ index. Exploiting the underlying structure of sensor scheduling problem, we reduce the computational complex of the Gittins’ index from O(N) to O(N) with respect to the network size N . Moreover, we derive closed-form expression for the Gittins’ index. We also show that choosing the sensor with the most residual energy is an optimal strategy when the channel fading is independently and identically distributed across sensors.
منابع مشابه
Finite dimensional algorithms for the hidden Markov model multi-armed bandit problem
The multi-arm bandit problem is widely used in scheduling of traffic in broadband networks, manufacturing systems and robotics. This paper presents a finite dimensional optimal solution to the multi-arm bandit problem for Hidden Markov Models. The key to solving any multi-arm bandit problem is to compute the Gittins index. In this paper a finite dimensional algorithm is presented which exactly ...
متن کاملSensor Scheduling for Multi-parameter Estimation under an Energy Constraint
We consider a sensor scheduling problem for estimating multiple independent Gaussian random variables under an energy constraint. The sensor measurements are described by a linear observation model; the observation noise is assumed to be Gaussian. We formulate this problem as a stochastic sequential allocation problem. Due to the Gaussian assumption and the linear observation model, this proble...
متن کاملBayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem
In this paper we investigate human exploration/exploitation behavior in sequential-decision making tasks. Previous studies have suggested that people are suboptimal at scheduling exploration, and heuristic decision strategies are better predictors of human choices than the optimal model. By incorporating more realistic assumptions about subject’s knowledge and limitations into models of belief ...
متن کاملBayesian and Approximate Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem
In this paper we investigate human exploration/exploitation behavior in sequential-decision making tasks. Previous studies have suggested that people are suboptimal at scheduling exploration, and heuristic decision strategies are better predictors of human choices than the optimal model. By incorporating more realistic assumptions about subject’s knowledge and limitations into models of belief ...
متن کاملBayesian and Approximate Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem
In this paper we investigate human exploration/exploitation behavior in a sequential-decision making task. Previous studies have suggested that people are suboptimal at scheduling exploration, and heuristic decision strategies are better predictors of human choices than the optimal model. By incorporating more realistic assumptions about subject’s knowledge and limitations into models of belief...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005